High-resolution genetic mapping of maize pan-genome sequence anchors
نویسندگان
چکیده
In addition to single-nucleotide polymorphisms, structural variation is abundant in many plant genomes. The structural variation across a species can be represented by a 'pan-genome', which is essential to fully understand the genetic control of phenotypes. However, the pan-genome's complexity hinders its accurate assembly via sequence alignment. Here we demonstrate an approach to facilitate pan-genome construction in maize. By performing 18 trillion association tests we map 26 million tags generated by reduced representation sequencing of 14,129 maize inbred lines. Using machine-learning models we select 4.4 million accurately mapped tags as sequence anchors, 1.1 million of which are presence/absence variations. Structural variations exhibit enriched association with phenotypic traits, indicating that it is a significant source of adaptive variation in maize. The ability to efficiently map ultrahigh-density pan-genome sequence anchors enables fine characterization of structural variation and will advance both genetic research and breeding in many crops.
منابع مشابه
A Single Molecule Scaffold for the Maize Genome
About 85% of the maize genome consists of highly repetitive sequences that are interspersed by low-copy, gene-coding sequences. The maize community has dealt with this genomic complexity by the construction of an integrated genetic and physical map (iMap), but this resource alone was not sufficient for ensuring the quality of the current sequence build. For this purpose, we constructed a genome...
متن کاملGenetic association mapping and genome organization of maize.
Association mapping, a high-resolution method for mapping quantitative trait loci based on linkage disequilibrium, holds great promise for the dissection of complex genetic traits. The recent assembly and characterization of maize association mapping panels, development of improved statistical methods, and successful association of candidate genes have begun to realize the power of candidate-ge...
متن کاملPanzea: a database and resource for molecular and functional diversity in the maize genome
Serving as a community resource, Panzea (http://www.panzea.org) is the bioinformatics arm of the Molecular and Functional Diversity in the Maize Genome project. Maize, a classical model for genetic studies, is an important crop species and also the most diverse crop species known. On average, two randomly chosen maize lines have one single-nucleotide polymorphism every approximately 100 bp; thi...
متن کاملAnalysis of the epistatic and QTL×environments interaction effects of plant height in maize (Zea mays L.)
A genetic map containing 103 microsatellite loci and 200 F2 plants derived from the cross R15 × Ye478 were used for mapping of quantitative trait loci (QTL) in maize (Zea mays L.). QTLs were characterized in a population of 200 F2:4 lines, derived from selfing the F2 plants, and were evaluated with two replications in two environments. QTL mapping analysis of plant height was performed by using...
متن کاملInsights into the maize pan-genome and pan-transcriptome.
Genomes at the species level are dynamic, with genes present in every individual (core) and genes in a subset of individuals (dispensable) that collectively constitute the pan-genome. Using transcriptome sequencing of seedling RNA from 503 maize (Zea mays) inbred lines to characterize the maize pan-genome, we identified 8681 representative transcript assemblies (RTAs) with 16.4% expressed in al...
متن کامل